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Abstract 

We formulate a simple edge generation rule based on an inverse like 
mass action principle for random graphs over a structured vertex set. We 
show that under very weak assumptions on this structure one obtains 
a scale free distribution for the degree. We furthermore introduce and 
study a "my friends are your friends" local search principle which makes 
the clustering coefficient large. 



1 Introduction 

Random graphs with a scale free distribution for the degree seem to appear very 
frequently in a great variety of real life situations like the Worldwide Web, the 
Internet, social networks, linguistic networks, citation networks and biochemical 
networks. An excellent recent survey is the Article by Albert and Barabasi [1]. 

Despite the large number of articles dealing with specific properties of such 
networks little is known about origin and formation principles of such graphs in 
the general framework. Most popular became the model of Albert and Barabasi 
[2] where graphs grow by successively adding a new vertex say x at each time 
step and forming edges between this new vertex and the already existing ones by 
assuming that the probability that x forms an edge with a vertex y whose degree 
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is k is about const ■ where N is the total number of vertices at that time. 
This gives a scale free graph although a very homogeneous one with little room 
for incorporating additional structures. The principle used in this construction, 
and the many variants of it, could be seen as a degree-mass-action principle 
since the degree acts here as a kind of positive affinity parameter for alignment 
of new vertices. Of course knowledge about the degree of a vertex is in real 
networks seldom available for other vertices. For instance in the social networks 
of friendships an individual certainly does not make the formation of a new 
friendship dependent of checking how many friends the potential new candidate 
already has (not to mention that it would be rather difficult to estimate this 
number). So we seek for an edge formation principle more related to a pre- 
given structure on the vertex set. We assume that this structure is specified 
by a real positive parameter u> € M + - like social importance, money or beauty- 
with a given probability distribution tp(u)). Furthermore we think of an edge 
between vertices x and y as the result of a directed choice made by either x or 
y (symbolized by x — > y or y — > x). This seems to be very reasonable since in 
many real life networks edges are formed that way. Note that although the edge 
creation is a directed process we consider the resulting graph as an undirected 
one since for the majority of relevant transmission processes on the network the 
original orientation of an edge is irrelevant. The crucial assumption about the 
pairing probability is now a kind of inverse mass-action-principle. Namely that 
the probability that a vertex x which makes a choice chooses a given vertex 
y with parameter ui (y) is proportional to [(p(w(y))] a ■ with some a > 
and N being the total number of vertices. Due to this principle it is not the 
actual value w of a vertex y which is relevant for the pairing probability but 
rather it's relative frequency of appearance in the "population". This is in a 
certain sense a market rule - the more rare a property is, the more attractive it 
becomes for others. Of course the property associated with lo has to be positive 
in perception. We think that this "Ansatz" captures the essence of many real 
life network formation processes. 

Instead of using a technical name for the principle like " inverse mass action" 
we want to call it the "Cameo-principle" having in mind the attractiveness, 
rareness and beauty of the small medallion with a profiled head in relief called 
Cameo. And it is exactly their rareness and beauty which gives them their high 
value. 

Our basic assumptions can be summarized as follows : 

• The parameter lo is independent identically distributed (i.i.d.) over the 
vertex set with a smooth monotone decreasing density function ip (u>) 

• Edges are formed by a sequence of choices. By a choice we mean that 
a vertex x chooses another vertex, say y , to form an edge between y 
and x. A vertex can make several choices. All choices are assumed to be 
independent of each other. 
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• If x makes a choice the probability of choosing y depends only on the 
relative density of ui (y) and is of the form 



• A pre-given outdegree distribution determines the number of choices made 
by the vertices. The total number of choices (and therefore the number of 
edges) is assumed to be about const- N. 

The striking observation under the above assumptions is now the emergence 
of a scale- free degree distribution independent of the choice of the lu— distribu- 
tion. Furthermore it can be shown that the exponent in the degree distribution 
becomes independent of ip (u) if the tail of ip decays faster then any power law. 
The precise formulation of the above statements is given in theorem 1 and 2 in 
the next section. 

2 Analysis of the model 

Let Vat = {1,...,N} be the vertex set of a random graph space. We are 
mainly interested in the asymptotic properties for N being very large. We 
assign i.i.d. to each element x from the set Vjv a continuous positive real 
random variable (r.v.) to (x) taken from a distribution with density function 
<p (to). The variable to can be interpreted as a parametrization of Vm- For a set 



Cuj q ,um = {x:lo(x)g [o>o,wi]} we obtain E ($C UOiUl ) — N- J ip(u)dw. Without 



loss of generality we always assume (p > on [0, oo). As a technical assumption 
we will furthermore need the following: 

Assumption 1: ip 6 C 2 ([0, oo)) and the second derivatives D 2 (<p>^) have no 
zeros for \fi\ £ (0,/io) and cu > loq (p) (this is just a monotonicity assumption 
on the tail of ip ). 

Edges are created by a directed process in which the basic events are choices 
made by the vertices. All choices are assumed to be i.i.d. The number of times a 
vertex x makes a choice is itself a random variable which may depend on x. We 
call this r.v. d out (x). The number of times a vertex x was chosen in the edge 
formation process is called the indegree di n (x) . Each choice generates a directed 
edge. We are mostly interested in the corresponding undirected graph. If we 
speak in the following about outdegree and indegree we refer just to the original 
direction in the edge formation process. Let p^ = Pr {x — > y \ to = to (y)} be 
the basic probability that a vertex y with a fixed value of to is chosen by x if x 
is about to make a choice. For a given realization £ of the r.v. to over Vat we 
assume: 



Vv{x-+y | u(y)} - [<p(w (y))} 



— <X 



■^;«e(0;i) 



(i) 



A(£,N) 1_ 
[<p(w)] a ■ N 



(2) 
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where a € (0, 1) and A (£, N) is a normalization constant. It is easy to see that 

oo 

the condition J [ip (u>)] ~ a duj < oo is necessary and sufficient to get A (£, N) — ► 
o 

A > for N — > oo where convergence is in the sense of probability. Therefore 
we need a < 1. One might argue that the choice probabilities should depend 
more explicitly on the actual realization £ of the r.v. oj over V/v -not only 
via the normalization constant. The reason not to do so is twofold. First it is 
mathematically unpleasant to work with the empirical distribution of oj induced 
by the realization £ since one had to use a somehow artificial N— dependent 
coarse graining. Second the empirical distribution is not really "observed" by 
the vertices (having in mind for instance individuals in a social network). What 
seems to be relevant is more the common believe about the distribution of u). 
In this sense our setting is a natural one. To keep the analysis simple we will 
first assume in section 2.1. that the r.v. d out (x) is constant and equal to k 
for all x. We will discuss the situation of a variable d out in section 2.2. Since 
we want to show that under the above conditions on ip we obtain a power law 
distribution for the degree we have to say a few words about the notion of 
power-law distribution in our framework. We say that a discrete probability 
distribution ip on N is of power law type with exponent j3 if 

^W = j^T3(i) ***>!■ (3) 

This means that our concept of power law distribution is an asymptotic one. 
We think that this is a very reasonable notion since in any empirical observation 
one can see only an approximation to a power law. 

Before starting with the proof of our claims we would like to mention that 
the emergence of a power law distribution in our setting is not so surprising as 
it might seem at the first look. The situation is best explained in form of an 
example. Let ip(u) = C ■ and define a new variable oj* — [^^p = 
. The new variable oj* can be seen as the effective parameter to which the 
vertex choice process applies. What is the induced distribution of oj* ? With 
F (z) = Pr {uj* < z} we obtain 

F(z)= [ ip (w) dw = -z^ - C (4) 



and therefore the oj*— distribution (b{uj*) = — ■ tt-t- This is a power law 

distribution with an exponent depending only on a. 

2.1 The case of constant outdegree ( d out (x) — k ) 
The main result of the paper is contained in the following 
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Theorem: 

i) Let tp (w) = ^3+ o(1) with /3 > 2. 

Then lim Pr {d (x) = fe I x G Vw} = -~r \ t ttt - 

ii) Let if (u>) fulfill the assumption 1. 
Then lim Pr{d (x) = k | x G Vat} = 



Remark: The theorem says, that under assumption 1 the degree distribu- 
tions for the finite random graphs of our model converge for N — > oo to a power 
law distribution in the sense of formula (3). 

Proof: Since by assumption the outdegree is constant and d (x) = di n (x) + 
dout (x) — di n {x) + ko asymptotically almost surely, we have to consider only the 
indegrce distribution. First we compute the expectation (in the limit N — > oo): 



fcn • A 

E (di„ (x)\u) = W (x)) = J^J^a (5) 



and therefore 



E (d in (x)) = k Q ■ A - J [ip (lo)} 1 -" duj . (6) 

For a x with fixed u> = oj (x) we get for the indegree distribution ip^ (k) = 
Pr {di n (x) = k | ui = uj (x)} a binomial distribution (here we count also multiple 
edges but the probability for their appearance vanishes for large N) 

fk N\ ( A (£, N) \ k ( A (£, N) \ koN - k 

which converges in the limit of large N to a Poisson distribution: 

V>„ (k) = C f e- C - (8) 

with Cuj = r^jjm • Finally we obtain for the unconditional limiting probability 
tp (k) = lim Pr (d,„ (x) = k) the expression 



N 

k 
k\ 



tp(k) = J <p(lj) §e" ' ■!/„• . (!)) 



The main contribution to V (&) comes from a rather small interval of ui -values 
- called / ess (k) - with the property that for loq G I ess (k) the expectation 
E(dj„ (x) | Ldo = ui (x)) is of order k. The rapid decay of the Poisson distri- 
bution will guarantee that the remaining parts of the integral become arbitrary 
small for large k. To fill in the details let 



(fc,7) 




' k -A 

^k + k^+t 



(10) 
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with 0<7<1. To estimate ip ess (k) = J <p> (u) jj-e Cuj dw we define a new 



variable s in the interval 
the integral into 

-fetr 

with 



byw^^- 1 and transform 



(f o (fi 



kg-A 
k+e 



2 



k -A 



ip tp 



1 



fc + e 



(11) 



(12) 



Here we used the relation Dip 1 (x) = [(Dtp) (tp> 1 (x))] 1 . Using Stirling's for- 
mula fc! = (27rfc)^ (^) fe (l + o (1)) and the approximation (l + f ) fe = e £ ~4fc +o(1) 
for |e| <C ki we get for the Poisson term in the integral 



~k 

%e- £ ' = (27Tfc)" 



k\ 

Since |e| <C k we obtain from formula (11) 

V'ess (fc) = 



e 2fc 



(13) 



(fco ■A)°(l|o(l)) 



/ 



-pop 



-1 / / fcp-A 



If the decay of </? is faster then any power-law we will show that 



— de(14) 



(27rfc)- 



w (V 1 f te 



(fc + e) 



o(l) 



(15) 



from which we can conclude that ipess (k) — 1+i 1 +o(1) holds. But first we will 
deal with the simpler case when ip is itself of power-law type. Let us assume 
that ip has an exact power-law tail of the form ^ . Therefore p^ 1 (u) = (§) ? 



and 



(Dp)^ 1 



kg- A 
k + e 



~l3C-{k -AY 
(k + e) o» 



(16) 



G 



and finally 



-ip o ip 



(Dp) (^((fcg)' 



(17) 



Since 



/ -,-de- l + o(l) (18) 

we get for the essential part of the degree distribution 

The correction term coming from the integration outside the essential domain 
can be estimated as follows 



Ipsmall (k) < Const ■ 



— de + / —de 



. - (27Tfc)5 / ( 2 7Tfc)- 



= °(i*Fs) ' <20) 
In the case when ip (w) is a power law in our weaker sense, say ip (w) = ^g+^rj 7 we 
need some regularity condition of the form Dp (ui) = 1+ g+ o(1 ) on the derivative 
to get an asymptotic power law for ip (k) . 

We continue now with the case when <p (u) decays faster then any power 
law, that is we assume 

p (ui) < -^-j for any I and uj > ui (I ) ■ (21) 

UJ 1 

To get the desired result we have to show that formula (15) holds. Since tp~ x (i) 
goes to infinity for uj — > 00 we have to show 

The last formula states that the negative logarithmic derivative of <p should not 
become to large or to small compared to ip respectively ^. For the following it 

is convenient to set p (x) = e^ 9 ^ with g (x) — > 00 and rewrite formula (22) as 

e - M fl(x) < _}_ < e M(x) for g ( 0; ) and x > a , ( ) _ (23) 

Dg (x) 
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Assume that formula (23) is not true with respect to the right hand side. Than 
we have for a sequence of values {xi} and open intervals Ii around the Xi and 
some function a (x) 

1 = e ra(x) a (x) and a (x) > 1 for x e h ■ (24) 
L>g (x) 

Integrating the last equation gives 

X 

e w(*) = e w(*o) + M f -±—dz . (25) 
J a(z) 



Since our assumption on D 



to be monotone for /i > and oj > ujq (/z) 



implies a (x) > 1 eventually we conclude that 

e"*' < e^ xa) + n(x-x ) . (26) 

But the fast decay condition for <p (x) translates into a growth condition for 
g{x) namely for all k 

g (x) > k log x; x> x (k) (27) 

which clearly contradicts formula (26). It remains to show that the left hand 
side of formula (23) also holds. Assuming the converse we get 

1 — = _L an d a(x)>l for x £ h (28) 



Dg (x) a (x) 

and after integration 



e ~ng(x) _ e -ra( 



X 

xa) - fj, j a (z) dz . (29) 



The monotonicity condition again implies a (x) > 1 eventually, hence 

e -M*) < e -«K*o) -^( x - Xo ) (30) 

and a clear contradiction since the right hand side becomes negative for large 
values of x. With this result in hand we get for the leading term 

Ass (k) = Mlg+g^ • f^A) ^ . e-^de 

ak 1+ ~ i \ k+e J (27Tfe)2 



k2 



-fe2 +T 
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For the minor integrals we get 

V small \ K ) — I a+ i. / /, 1\\ ,„ ,1 K 

i ^ fc + £ ) " (^)(^ 1 ((^) a )) i27Tk)2 ( 32 ) 



and 



(2 W fc)2 



To see that Ismail CO * s °f or der o ( fcl |i ) fix 7 > <5 > sumciently small and 
some k' such that 



fe+e y ) / t 

< — ) for fc > fc' . (34) 



With 



k a A 



C max (Jo) = max 

0<fe<fe' 



((¥)*)) j 



> < 00 (35) 



we obtain for V'ima/; CO the following estimation: 



C.U (*) < (^) " • Cmax (S) k'e-H-^') + 

+ I ~^+¥ W { 2,k)h d£ 



Therefore we obtain in the case when ip decays faster then any power law that 
the degree distribution ip (k) — 1 1 +o(1) is independent of ip. □ 

2.2 The case of nonconstant outdegree 

Next we deal with the case when the outdegree distribution is not constant. 
We will always assume that the expected outdegree is finite to keep the total 
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number of edges proportional to N. First we show that the indegree distribution 
is the same as in the constant degree case with ko replaced by E (d out (x) ) . With 
respect to the total degree distribution two situations can appear. Either the 
r.v. d ou t is independent of the r.v. lo or not. In the independent case one has to 
take then just the convolution of indegree and outdegree distribution to get the 
total degree distribution. The guiding principle here is, that the dominating 
distribution wins. If things arc not independent the analysis becomes quite 
more involved and we will sketch only the relevant steps dealing with a detailed 
investigation in a separate paper. 

We start with the independent case. Let 

q k =Pr {d out (x) = k} (37) 

and 



p k = Pv{d in (x) = k} (38) 

the asymptotic outdegree respectively indegree distributions. Since we assumed 
asymptotic independence we get for the total degree distribution dk the convo- 
lution 

dk= qi ' Pk ~ l ■ ( 39 ) 

0<i<fc 

We say that the tail of a distribution {pi} dominates the tail of another distribu- 
tion {qi} if qi = o(pi) . For the {pi} we assume further that it is an asymptotic 
power-law distribution with exponent 7 > 2. We estimate dk as follows: 

dk = Z) 9* • Pk-i + Qk-i ■ Pi 

0<i<# 0<i<# 



- E ft' (fc_ i )7 +0<1) + ^ > ° ((fe^Wl) • 
0<i<| v ' 0<i<f xv ' ' 

= k -r+o(i) S Qi' ( 1 _ ± y+o(.i) + ( fe 7+o<i) ) 12 ( 1 _±y+°W 'Pi ■ ( 40 ) 



0<i<| \ kj 0<i<_ 

Since both sums in the last expression are bounded and larger zero we obtain 

dk = £hW ■ ( 41 ) 

In the case when the outdegree distribution is the dominating one ( and in 
our case it has to be a power-law distribution) we can just reverse the above 
argument to show that the total degree distribution is in this situation the 
outdegree distribution. 

It remains to show that a variable outdegree will not affect the structure of 
the indegree distribution. The main difference in comparison to the previous 
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section is that formula (7) gets replaced by 



1pu (k) = 

¥ p r « otetlges ,,,.M 1 (^ ( ^)'( 1 -^)*(: 2) 

Since for M — const ■ N each binomial term still converges to a Poisson dis- 
tribution with parameter it is not difficult to show that in case when 
a limiting outdegree distribution exists and the first moment is finite one still 
obtains a limiting Poisson distribution where k is replaced by E (d out ) . 



2.3 The degree correlation and the clustering coefficient 

It is interesting to note that graphs formed by the "Cameo Principle" show a 
degree affinity which is observed in many real networks and which is also one of 
the underlying assumption in the Albert Barabasi model, namely 

k 

Pr {an edge starting in x ends up in a vertex y with d (y) = k} = — • const . 

N (43) 

To see that this is the case recall from the previous sections that the typical 
vertices with degree k have uu— values from I ess (k) for large k. Due to the 
definition of the directed edge probability we get for |e| < fcs+T 



Pr{x -> y | uj (y) e I ess (k)} 



^(^ 1 ((^) i )) 



JV 



A ± _ fc(l+o(l)) _ fc(l+o(l)) 
(tM.) ' N k N V d(x) 



= Pr{x ^ y \ d{y) = k} . (44) 

Therefore it seems to us, that if one observes in real networks edge affinities 
proportional to the degree one cannot conclude that the generation of the graph 
is also due to an edge formation process build up from this proportionality (like 
in the Albert-Barabasi model). 

We want to close this section with a short discussion of the clustering coeffi- 
cient. Since no explicit local clustering rule was implemented in our model one 
cannot expect a high clustering coefficient. Nevertheless it will be slightly larger 
then in the classical G (A, p) case. Again we will study here only the situation 
for constant outdegree fc . Remember that the clustering coefficient of a vertex 
x with degree k is defined as 

rM 2 ■ jj {(y, z) ■ y ~ z and y z £ Ai (a;)} 

c {x) = hj—T) (45) 



n 



where Ni (x) is the set of neighbors of x. The global clustering coefficient is then 
just the expectation E (C (x)) . In what follows we will only give an explicit com- 
putation for the number of edges in N\ (x) which is a little bit simpler to com- 
pute. Since the edge formation probabilities depend only on the "target" ver- 
tices, we get on Ni (x) the same uj— distribution as on the whole graph. In other 
words for any fixed x the probability that a y G Ni (x) has uj (y) > uj a is given by 

oo 

J tp (uj) dui. Therefore to compute E (Jj {(y, z) ; y ~ z and y, z <E Ni (x)} | d (x) = 

too 

= k) it is enough to compute the expected number of edges in a random vertex 
sample of given size k. It is easy to see that this number equals fc °^ -1 ^ . Since 
we know further that in the limit N — > oo the degree distribution for an x with 

uj (x) = w is the Poisson distribution ( k "_ ko y_ e~ c " with c u — ^"^a and k > k 
we get with T (uj) := E ((J {(y, z);y ~ z and y,z£]Vi (x)} | u) (x) = uj) 

1 \ w ) — Ar(fe-fco)! e 

k>ko 



E k a (m+k Q -l)c™ 
N-m\ e 



m>0 



fcoe^+fco(fc -l) 
N 



(46) 



and finally 



/■ fc , + fc (ko - 1) 

jT(cj) d cj = j N 0{ ° 3 duj (47) 

as the expected number of edges among the neighbors of a random chosen vertex 
x. Completely analogous one can obtains the asymptotics co ^ st for the expected 
clustering coefficient C. The explicit expression for the constant there is more 
cumbersome but straightforward. 



3 A growing network model 

In the above sections we described a model where the total number of vertices 
is large but fixed and all edges are generated at once simultaneously. Since real 
networks are typically evolving networks many models allow a growth process 
for the graphs. This is easy to incorporate in our situation too. For simplicity 
and since it is rather common today we use the evolution concept from Albert 
and Barabasi [2]. 

We start with an initial graph Go with No vertices and at each time t G N a 
fixed number Lq of new vertices with to- values taken again i.i.d. with density 
Lp (uj) . Each vertex forms at the time when it enters the graph, ko edges with 
the vertices already present. The edges are formed according to the "Cameo 
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Principle" that is 

Put (t) '■= Pr {x — > y | u) = uj (y) and x enters the graph at time i) = 

where £ t is the realization of the uj- values at time t, N t = tL + N and 
A(£ t ,N t ) a normalization constant. Let d(x,t,r) be the degree of x at time t 
and t (x) = t the time when x entered the graph. For x fixed and (>rwe 
have 

E(d in {x,t)) = \ - ■ — ~ logt-logr . 

f£[<p(w(x))] N t [p(u(x))] -L ( 49 ) 

We now investigate the asymptotic indegree distribution, namely the asymptotic 
probability that a randomly chosen vertex from the evolving graph at time t 
has indegree k. For simplicity we deal here only with the case Lq = 1. We first 
look at the indegree distribution ip^ (k,T,t) — Pr{d in (x,t) = k} for a fixed x 
with uj — uj (x) and r = r (x). Since 



^(k,T,t) = Pv{ Yl ly~* = fc} (50) 

k I/:t(i/)>t(x) 

with ly^ x being the independent r.v. of the indicator function that y formed an 
edge with x , one gets by a well known theorem in probability theory for t — r 
large a limiting Poisson distribution with expectation equal to E (dj„ {x,t)) (see 
[6] for recent sharp results in this direction) . In complete analogy to section 2 we 
obtain after integration over ui a scale free distribution for ifj (k, t, t)- the not on 
uj conditioned indegree distribution. Taking explicit care on the r dependence 
we get 

Note that o (1) does not depend on r for t — r 3> k. To obtain the final (not on 
t conditioned) distribution i\j (k, t) we just have to sum 



i/>(k,t) = - t Yj ^(k,r,t) 



0<T<t 



1 (fc ^[logf-logr])° 

0<r<t ft 
jfcl+i+o(l) ' (52) 
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In the last two equalities we have implicitly used the fact that the terms with 
t — t ~ const ■ k do not really contribute to the V (k,t) distribution and that 
J2 logr~tlogi-t. 

0<T<t 

4 A model variant with high clustering coeffi- 
cient (the "my friends are your friends"— prin- 
ciple) 

To get high clustering coefficients one has to incorporate local edge formation 
rules into the model like in the famous small world model of Watts and Strogatz 
[7] . A natural way to do this for our graphs is by introducing a kind of additional 
local choice rule: with probability p make the choice into the neighbors of your 
own neighbors at the given time and with probability q = 1 — p proceed like 
above. We call this rule the "my fiends are your friends" principle for obvious 
reasons. It should be a well known principle in social sciences since many social 
contacts are formed that way. We will give a detailed investigation of the types 
of graphs resulting from such processes in a forthcoming paper [4] . Here we will 
present only an informal discussion of some of the main properties. First it is 
important to note that the order in which the edges are created plays now a 
role (at least for the fine structure of local properties of the graphs). To fix the 
setting let us consider the following rule. The enumeration of the vertex set we 
will associate with the time at which the vertex will make a choice. The first 
choice each vertex makes is an independent one according to the rules specified 
in the previous paragraph. For the next choices any vertex x has two options 
- with probability q = 1 — p he proceeds as with the first choice (independent, 
global choice) or with probability p he chooses from iV 2 (x, t), the set of vertices 
at distance 2 from x at time t (dependent, local choice). We remark that the 
concrete form of ip (ui) is irrelevant for the further discussion. Two situations 
are now natural to consider: 

Case A: At an initial set of choices all vertices make their first, independent 
choice (here the order in which the choices are made does not matter). After 
that, each vertex makes it's second choice at the time corresponding to it's index 
number, then all vertices make their third choice and so on (to avoid a bias one 
should renumber randomly the vertex set after each sequence of choices) . 

Case B: Each vertex makes the remaining ko — 1 choices immediately after the 
first one. Again the time when a vertex is about to make the choices corresponds 
to it's index number. 

Let T (x) be the number of triangles in the final graph containing x. There 
are some trivial lower and upper bounds on the expectation of T (x) . First ob- 
serve that due to the bounded outdegree there cannot be more than (k — 1) d (x) 
edges among N\ (x) . On the other side any choice made by x according to the 
"my friends are your friends" -rule automatically generates a triangle contain- 
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ing x. Therefore K(T(x)) > p (ko — 1) and hence the correlation coefficient is 
independent of N asymptotically. We claim that for large d (x) the conditional 
expectation given the degree is approximately E (T (x) \ d (x)) ~ const ■ d (x) . 
We outline the reason for that for ko — 2 and the situation in case B. Note that 
the rule from case B is also the canonical one if one wants to deal with growing 
networks. 

Let d\ n (x) be the indegree of x after each vertex made the first choice. 
Denote further by iV 1 (x) the set of vertices at distance I from x after the first 
choices are made. Since the first choices are independent, a vertex x is not part of 
a triangle almost surely. In the second round of choices triangular configurations 
will be formed frequently. Since we assumed that d (x) is large we can assume 
that d\ n (x) is large too. There are two mechanisms which contribute to T (x) 
when the second choices are made. First all vertices in Nl (x) create an edge 
into TV; 1 (x) with probability p and since | Nl (x) | = d\ n (x) + 1 is assumed to be 
large we can neglect the events that multiple edges are created. So we expect 
p(dj n (x) + 1) triangles containing x from this process. Second any vertex z from 
Nl (x) can chose x with probability p^^yj an d therefore create a new triangle. 

Furthermore for most vertices z G Nl (x) we have 

\Nl(x)\~dl(x)- const -\Nl(z)\ (53) 

from which we get an estimation of order pd\ n (x) • const for the expected number 
of triangles generated via this process. Combining this estimations and using 
the fact that the number of independent choices pointing to x in the second 
choice sequence is about (1 — p) d\ n (x) we obtain 

d {x) ~ d] n (x) + (1 - p) d\ n (x) + p(d] n (x) + 1) + pd\ n (x) • const 

~ d] n (x) (2 + p ■ const) (54) 

which gives for the expected number of triangles 

^'"w-Itf^m ■ ,55) 

The above considerations are of course of heuristic type but can easily made 
be precise. The const appearing in the above formulas can as well be explicitly 
computed. We remark finally that the above considerations remain true if the 
outdegree distribution is not a constant and that the clustering mechanism does 
not change the asymptotic shape of the total degree distribution. We plan to 
discuss all this aspects in much more detail in a paper in preparation. 

5 Conclusions and outlook 

We have shown in this article that graphs with a scale-free degree distribution 
appear naturally as the result of a simple edge formation rule based on choices 
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where the probability to chose a vertex with affinity parameter u> is proportional 
to the frequency of appearance of that parameter. If the affinity parameter u> 
is itself power law like distributed one could also use a direct proportionality 
to the value u> to get still a scale free graph since ip (u) is itself a polynomial 
relation (this explains the observation reported in [5]). 

We have not tried in this paper to give estimations on the error terms for the 
asymptotics. Numerical simulations, which will be reported elsewhere, indicate 
that already for values of k of order const ■ fc and vertex sets of size N w 10 5 
one has a very good agreement with the asymptotic result. Of course in all this 
finite size effects the concrete form of ip really matters. 

It seems natural to investigate models where ip (w) has a singular, for instance 
discrete, support. Such a situation is common when one couples for instance the 
choice probabilities directly to the outdegree (see [3] for a specific model of such 
type). Some care is necessary in that case. For instance if one takes a discrete 
integer valued random variable k with the distribution ip (k) = const ■ exp (— k) 
and proceeds like in the case of the continuous variable u> it will be not longer 
true that the asymptotic degree distribution is a power-law distribution. The 
reason behind this is the following. The expected indegree of vertices x and y 
with k (x) = kg and k(y) = fco + 1 differs by a constant factor but since the 
indegree of x and y is concentrated in an interval of length square root of the 
expectation around the expectation there appears a gap in the degree distribu- 
tion of the whole graph. If ip (k) is a power law distribution with exponent (3 
the same remains true for a > | whereas for a < j| a similar proof with the 
same results as in the continuous case can be given. In general one needs for 
singular ip (lo) an overlap condition for the involved Poisson distributions to get 
a power law tail distribution. 

With respect to the clustering mechanism described in section 4 there are 
many interesting questions. What is the effect on diameter, average path-length 
and spectral properties? 
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paper. 
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